Here I will clean the 2019_subway_rider_data
Now I will specifically clean the data within the columns
Cleaned rider frequency table:converting complaints into long format:
Making some of the variables as a character and some as a factor!<<<<<<< HEAD Converting zip codes into boroughs. I used https://www.nyc.gov/assets/planning/download/office/data-maps/nyc-population/census2000/sf1p11.xls to get majority of the borough’s zip codes. Any remaining ones, I looked up on Google. There were three zip codes that I could not find a match for, and were likely miswritten, so I removed those data rows.
Create complaint_data_time from rider_data_full with time information